Dataset statistics
| Number of variables | 31 |
|---|---|
| Number of observations | 579958 |
| Missing cells | 2981327 |
| Missing cells (%) | 16.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 137.2 MiB |
| Average record size in memory | 248.0 B |
Variable types
| Numeric | 20 |
|---|---|
| DateTime | 1 |
| Categorical | 4 |
| Text | 6 |
CANCELLED is highly imbalanced (94.9%) | Imbalance |
DIVERTED is highly imbalanced (97.8%) | Imbalance |
CANCELLATION_CODE has 576648 (99.4%) missing values | Missing |
CARRIER_DELAY has 475839 (82.0%) missing values | Missing |
WEATHER_DELAY has 475839 (82.0%) missing values | Missing |
NAS_DELAY has 475839 (82.0%) missing values | Missing |
SECURITY_DELAY has 475839 (82.0%) missing values | Missing |
LATE_AIRCRAFT_DELAY has 475839 (82.0%) missing values | Missing |
WEATHER_DELAY is highly skewed (γ1 = 22.11241594) | Skewed |
SECURITY_DELAY is highly skewed (γ1 = 151.3139368) | Skewed |
DEP_DELAY has 28184 (4.9%) zeros | Zeros |
ARR_DELAY has 10858 (1.9%) zeros | Zeros |
CARRIER_DELAY has 41858 (7.2%) zeros | Zeros |
WEATHER_DELAY has 99571 (17.2%) zeros | Zeros |
NAS_DELAY has 57455 (9.9%) zeros | Zeros |
SECURITY_DELAY has 103517 (17.8%) zeros | Zeros |
LATE_AIRCRAFT_DELAY has 50580 (8.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-03-30 05:49:34.144341 |
|---|---|
| Analysis finished | 2024-03-30 05:52:27.154448 |
| Duration | 2 minutes and 53.01 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
DAY_OF_WEEK
Real number (ℝ)
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.7665296 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.0019356 |
|---|---|
| Coefficient of variation (CV) | 0.53150666 |
| Kurtosis | -1.2111724 |
| Mean | 3.7665296 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.17139827 |
| Sum | 2184429 |
| Variance | 4.0077461 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1 | 97754 | |
| 3 | 93480 | |
| 2 | 91668 | |
| 5 | 78217 | |
| 4 | 78087 | |
| 7 | 74954 | |
| 6 | 65798 |
| Value | Count | Frequency (%) |
| 1 | 97754 | |
| 2 | 91668 | |
| 3 | 93480 | |
| 4 | 78087 | |
| 5 | 78217 | |
| 6 | 65798 | |
| 7 | 74954 |
| Value | Count | Frequency (%) |
| 7 | 74954 | |
| 6 | 65798 | |
| 5 | 78217 | |
| 4 | 78087 | |
| 3 | 93480 | |
| 2 | 91668 | |
| 1 | 97754 |
FL_DATE
Date
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
| Minimum | 2023-05-01 00:00:00 |
|---|---|
| Maximum | 2023-05-31 00:00:00 |
OP_UNIQUE_CARRIER
Categorical
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
| WN | |
|---|---|
| DL | |
| AA | |
| UA | |
| OO | |
| Other values (10) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1159916 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 9E |
|---|---|
| 2nd row | 9E |
| 3rd row | 9E |
| 4th row | 9E |
| 5th row | 9E |
Common Values
| Value | Count | Frequency (%) |
| WN | 122521 | |
| DL | 84459 | |
| AA | 79782 | |
| UA | 62340 | |
| OO | 56397 | |
| YX | 26540 | 4.6% |
| B6 | 24639 | 4.2% |
| NK | 22506 | 3.9% |
| AS | 20641 | 3.6% |
| MQ | 17641 | 3.0% |
| Other values (5) | 62492 |
Length
| Value | Count | Frequency (%) |
| wn | 122521 | |
| dl | 84459 | |
| aa | 79782 | |
| ua | 62340 | |
| oo | 56397 | |
| yx | 26540 | 4.6% |
| b6 | 24639 | 4.2% |
| nk | 22506 | 3.9% |
| as | 20641 | 3.6% |
| mq | 17641 | 3.0% |
| Other values (5) | 62492 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 249439 | |
| N | 145027 | |
| O | 128843 | |
| W | 122521 | |
| D | 84459 | 7.3% |
| L | 84459 | 7.3% |
| U | 62340 | 5.4% |
| 9 | 30127 | 2.6% |
| Y | 26540 | 2.3% |
| X | 26540 | 2.3% |
| Other values (11) | 199621 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1159916 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 249439 | |
| N | 145027 | |
| O | 128843 | |
| W | 122521 | |
| D | 84459 | 7.3% |
| L | 84459 | 7.3% |
| U | 62340 | 5.4% |
| 9 | 30127 | 2.6% |
| Y | 26540 | 2.3% |
| X | 26540 | 2.3% |
| Other values (11) | 199621 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1159916 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 249439 | |
| N | 145027 | |
| O | 128843 | |
| W | 122521 | |
| D | 84459 | 7.3% |
| L | 84459 | 7.3% |
| U | 62340 | 5.4% |
| 9 | 30127 | 2.6% |
| Y | 26540 | 2.3% |
| X | 26540 | 2.3% |
| Other values (11) | 199621 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1159916 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 249439 | |
| N | 145027 | |
| O | 128843 | |
| W | 122521 | |
| D | 84459 | 7.3% |
| L | 84459 | 7.3% |
| U | 62340 | 5.4% |
| 9 | 30127 | 2.6% |
| Y | 26540 | 2.3% |
| X | 26540 | 2.3% |
| Other values (11) | 199621 |
OP_CARRIER_FL_NUM
Real number (ℝ)
| Distinct | 5906 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2276.8748 |
| Minimum | 1 |
|---|---|
| Maximum | 8815 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 286 |
| Q1 | 1044 |
| median | 2036 |
| Q3 | 3289 |
| 95-th percentile | 5345 |
| Maximum | 8815 |
| Range | 8814 |
| Interquartile range (IQR) | 2245 |
Descriptive statistics
| Standard deviation | 1549.9056 |
|---|---|
| Coefficient of variation (CV) | 0.6807162 |
| Kurtosis | -0.51495536 |
| Mean | 2276.8748 |
| Median Absolute Deviation (MAD) | 1062 |
| Skewness | 0.63462089 |
| Sum | 1.3204918 × 109 |
| Variance | 2402207.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 538 | 385 | 0.1% |
| 533 | 343 | 0.1% |
| 777 | 335 | 0.1% |
| 1073 | 332 | 0.1% |
| 341 | 314 | 0.1% |
| 354 | 307 | 0.1% |
| 374 | 301 | 0.1% |
| 2095 | 296 | 0.1% |
| 1355 | 291 | 0.1% |
| 201 | 289 | < 0.1% |
| Other values (5896) | 576765 |
| Value | Count | Frequency (%) |
| 1 | 142 | |
| 2 | 176 | |
| 3 | 130 | |
| 4 | 198 | |
| 5 | 90 | |
| 6 | 87 | |
| 7 | 95 | |
| 8 | 70 | < 0.1% |
| 9 | 179 | |
| 10 | 196 |
| Value | Count | Frequency (%) |
| 8815 | 4 | |
| 8811 | 1 | < 0.1% |
| 8801 | 2 | |
| 8800 | 4 | |
| 8799 | 1 | < 0.1% |
| 8790 | 1 | < 0.1% |
| 8789 | 1 | < 0.1% |
| 8788 | 2 | |
| 8787 | 2 | |
| 8786 | 2 |
ORIGIN_AIRPORT_ID
Real number (ℝ)
| Distinct | 342 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12651.597 |
| Minimum | 10135 |
|---|---|
| Maximum | 16869 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 10135 |
|---|---|
| 5-th percentile | 10397 |
| Q1 | 11292 |
| median | 12889 |
| Q3 | 14027 |
| 95-th percentile | 14893 |
| Maximum | 16869 |
| Range | 6734 |
| Interquartile range (IQR) | 2735 |
Descriptive statistics
| Standard deviation | 1527.9889 |
|---|---|
| Coefficient of variation (CV) | 0.12077439 |
| Kurtosis | -1.2972769 |
| Mean | 12651.597 |
| Median Absolute Deviation (MAD) | 1591 |
| Skewness | 0.10392936 |
| Sum | 7.337395 × 109 |
| Variance | 2334750.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10397 | 28740 | 5.0% |
| 11292 | 23997 | 4.1% |
| 11298 | 23900 | 4.1% |
| 13930 | 21932 | 3.8% |
| 12892 | 16736 | 2.9% |
| 12889 | 16205 | 2.8% |
| 11057 | 16121 | 2.8% |
| 14107 | 14621 | 2.5% |
| 12953 | 14517 | 2.5% |
| 13204 | 13983 | 2.4% |
| Other values (332) | 389206 |
| Value | Count | Frequency (%) |
| 10135 | 351 | 0.1% |
| 10136 | 93 | < 0.1% |
| 10140 | 2039 | |
| 10141 | 62 | < 0.1% |
| 10146 | 62 | < 0.1% |
| 10154 | 100 | < 0.1% |
| 10155 | 91 | < 0.1% |
| 10157 | 142 | < 0.1% |
| 10158 | 229 | < 0.1% |
| 10165 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 16869 | 141 | < 0.1% |
| 16218 | 124 | < 0.1% |
| 15991 | 62 | < 0.1% |
| 15919 | 980 | |
| 15897 | 29 | < 0.1% |
| 15841 | 62 | < 0.1% |
| 15624 | 837 | |
| 15607 | 62 | < 0.1% |
| 15582 | 53 | < 0.1% |
| 15569 | 53 | < 0.1% |
ORIGIN
Text
| Distinct | 342 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1739874 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ROC |
|---|---|
| 2nd row | ITH |
| 3rd row | CLE |
| 4th row | IAD |
| 5th row | JFK |
| Value | Count | Frequency (%) |
| atl | 28740 | 5.0% |
| den | 23997 | 4.1% |
| dfw | 23900 | 4.1% |
| ord | 21932 | 3.8% |
| lax | 16736 | 2.9% |
| las | 16205 | 2.8% |
| clt | 16121 | 2.8% |
| phx | 14621 | 2.5% |
| lga | 14517 | 2.5% |
| mco | 13983 | 2.4% |
| Other values (332) | 389206 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 199031 | 11.4% |
| L | 159705 | 9.2% |
| S | 149520 | 8.6% |
| D | 136291 | 7.8% |
| T | 91277 | 5.2% |
| O | 89843 | 5.2% |
| C | 87132 | 5.0% |
| M | 77789 | 4.5% |
| F | 72666 | 4.2% |
| W | 68452 | 3.9% |
| Other values (16) | 608168 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 199031 | 11.4% |
| L | 159705 | 9.2% |
| S | 149520 | 8.6% |
| D | 136291 | 7.8% |
| T | 91277 | 5.2% |
| O | 89843 | 5.2% |
| C | 87132 | 5.0% |
| M | 77789 | 4.5% |
| F | 72666 | 4.2% |
| W | 68452 | 3.9% |
| Other values (16) | 608168 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 199031 | 11.4% |
| L | 159705 | 9.2% |
| S | 149520 | 8.6% |
| D | 136291 | 7.8% |
| T | 91277 | 5.2% |
| O | 89843 | 5.2% |
| C | 87132 | 5.0% |
| M | 77789 | 4.5% |
| F | 72666 | 4.2% |
| W | 68452 | 3.9% |
| Other values (16) | 608168 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 199031 | 11.4% |
| L | 159705 | 9.2% |
| S | 149520 | 8.6% |
| D | 136291 | 7.8% |
| T | 91277 | 5.2% |
| O | 89843 | 5.2% |
| C | 87132 | 5.0% |
| M | 77789 | 4.5% |
| F | 72666 | 4.2% |
| W | 68452 | 3.9% |
| Other values (16) | 608168 |
ORIGIN_CITY_NAME
Text
| Distinct | 336 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 13.034909 |
| Min length | 8 |
Characters and Unicode
| Total characters | 7559700 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rochester, NY |
|---|---|
| 2nd row | Ithaca/Cortland, NY |
| 3rd row | Cleveland, OH |
| 4th row | Washington, DC |
| 5th row | New York, NY |
| Value | Count | Frequency (%) |
| ca | 63479 | 4.7% |
| tx | 60293 | 4.5% |
| fl | 51214 | 3.8% |
| ny | 33161 | 2.5% |
| san | 30927 | 2.3% |
| ga | 30911 | 2.3% |
| new | 30880 | 2.3% |
| il | 30381 | 2.2% |
| chicago | 29268 | 2.2% |
| atlanta | 28740 | 2.1% |
| Other values (408) | 963452 |
Most occurring characters
| Value | Count | Frequency (%) |
| 772748 | 10.2% | |
| , | 579958 | 7.7% |
| a | 577216 | 7.6% |
| o | 417203 | 5.5% |
| e | 398416 | 5.3% |
| n | 371503 | 4.9% |
| t | 359175 | 4.8% |
| l | 331223 | 4.4% |
| i | 285708 | 3.8% |
| r | 274691 | 3.6% |
| Other values (47) | 3191859 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7559700 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 772748 | 10.2% | |
| , | 579958 | 7.7% |
| a | 577216 | 7.6% |
| o | 417203 | 5.5% |
| e | 398416 | 5.3% |
| n | 371503 | 4.9% |
| t | 359175 | 4.8% |
| l | 331223 | 4.4% |
| i | 285708 | 3.8% |
| r | 274691 | 3.6% |
| Other values (47) | 3191859 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7559700 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 772748 | 10.2% | |
| , | 579958 | 7.7% |
| a | 577216 | 7.6% |
| o | 417203 | 5.5% |
| e | 398416 | 5.3% |
| n | 371503 | 4.9% |
| t | 359175 | 4.8% |
| l | 331223 | 4.4% |
| i | 285708 | 3.8% |
| r | 274691 | 3.6% |
| Other values (47) | 3191859 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7559700 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 772748 | 10.2% | |
| , | 579958 | 7.7% |
| a | 577216 | 7.6% |
| o | 417203 | 5.5% |
| e | 398416 | 5.3% |
| n | 371503 | 4.9% |
| t | 359175 | 4.8% |
| l | 331223 | 4.4% |
| i | 285708 | 3.8% |
| r | 274691 | 3.6% |
| Other values (47) | 3191859 |
ORIGIN_STATE_NM
Text
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 14 |
| Mean length | 8.1701554 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4738347 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New York |
|---|---|
| 2nd row | New York |
| 3rd row | Ohio |
| 4th row | Virginia |
| 5th row | New York |
| Value | Count | Frequency (%) |
| california | 63479 | 9.5% |
| texas | 60293 | 9.0% |
| florida | 51214 | 7.7% |
| new | 48776 | 7.3% |
| york | 33161 | 5.0% |
| georgia | 30911 | 4.6% |
| illinois | 30381 | 4.6% |
| carolina | 29367 | 4.4% |
| colorado | 26056 | 3.9% |
| north | 25213 | 3.8% |
| Other values (51) | 267897 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 633376 | |
| i | 533340 | 11.3% |
| o | 451285 | 9.5% |
| n | 347472 | 7.3% |
| r | 344588 | 7.3% |
| e | 294096 | 6.2% |
| s | 270808 | 5.7% |
| l | 261741 | 5.5% |
| C | 120764 | 2.5% |
| d | 113454 | 2.4% |
| Other values (37) | 1367423 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4738347 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 633376 | |
| i | 533340 | 11.3% |
| o | 451285 | 9.5% |
| n | 347472 | 7.3% |
| r | 344588 | 7.3% |
| e | 294096 | 6.2% |
| s | 270808 | 5.7% |
| l | 261741 | 5.5% |
| C | 120764 | 2.5% |
| d | 113454 | 2.4% |
| Other values (37) | 1367423 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4738347 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 633376 | |
| i | 533340 | 11.3% |
| o | 451285 | 9.5% |
| n | 347472 | 7.3% |
| r | 344588 | 7.3% |
| e | 294096 | 6.2% |
| s | 270808 | 5.7% |
| l | 261741 | 5.5% |
| C | 120764 | 2.5% |
| d | 113454 | 2.4% |
| Other values (37) | 1367423 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4738347 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 633376 | |
| i | 533340 | 11.3% |
| o | 451285 | 9.5% |
| n | 347472 | 7.3% |
| r | 344588 | 7.3% |
| e | 294096 | 6.2% |
| s | 270808 | 5.7% |
| l | 261741 | 5.5% |
| C | 120764 | 2.5% |
| d | 113454 | 2.4% |
| Other values (37) | 1367423 |
ORIGIN_WAC
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.306536 |
| Minimum | 1 |
|---|---|
| Maximum | 93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 13 |
| Q1 | 33 |
| median | 44 |
| Q3 | 82 |
| 95-th percentile | 91 |
| Maximum | 93 |
| Range | 92 |
| Interquartile range (IQR) | 49 |
Descriptive statistics
| Standard deviation | 26.86296 |
|---|---|
| Coefficient of variation (CV) | 0.49465427 |
| Kurtosis | -1.3219643 |
| Mean | 54.306536 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | -0.013525889 |
| Sum | 31495510 |
| Variance | 721.61862 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 91 | 63479 | 10.9% |
| 74 | 60293 | 10.4% |
| 33 | 51214 | 8.8% |
| 22 | 33161 | 5.7% |
| 34 | 30911 | 5.3% |
| 41 | 30381 | 5.2% |
| 82 | 26056 | 4.5% |
| 36 | 23857 | 4.1% |
| 38 | 20414 | 3.5% |
| 85 | 17797 | 3.1% |
| Other values (42) | 222395 |
| Value | Count | Frequency (%) |
| 1 | 3044 | 0.5% |
| 2 | 11251 | |
| 3 | 3306 | 0.6% |
| 4 | 449 | 0.1% |
| 5 | 105 | < 0.1% |
| 11 | 1862 | 0.3% |
| 12 | 1238 | 0.2% |
| 13 | 12744 | |
| 14 | 574 | 0.1% |
| 15 | 1285 | 0.2% |
| Value | Count | Frequency (%) |
| 93 | 15970 | 2.8% |
| 92 | 6670 | 1.2% |
| 91 | 63479 | |
| 88 | 577 | 0.1% |
| 87 | 9607 | 1.7% |
| 86 | 2259 | 0.4% |
| 85 | 17797 | 3.1% |
| 84 | 1898 | 0.3% |
| 83 | 2249 | 0.4% |
| 82 | 26056 |
DEST_AIRPORT_ID
Real number (ℝ)
| Distinct | 342 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12651.522 |
| Minimum | 10135 |
|---|---|
| Maximum | 16869 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 10135 |
|---|---|
| 5-th percentile | 10397 |
| Q1 | 11292 |
| median | 12889 |
| Q3 | 14027 |
| 95-th percentile | 14893 |
| Maximum | 16869 |
| Range | 6734 |
| Interquartile range (IQR) | 2735 |
Descriptive statistics
| Standard deviation | 1528 |
|---|---|
| Coefficient of variation (CV) | 0.12077598 |
| Kurtosis | -1.2973229 |
| Mean | 12651.522 |
| Median Absolute Deviation (MAD) | 1591 |
| Skewness | 0.10400847 |
| Sum | 7.3373511 × 109 |
| Variance | 2334783.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10397 | 28741 | 5.0% |
| 11292 | 23988 | 4.1% |
| 11298 | 23926 | 4.1% |
| 13930 | 21941 | 3.8% |
| 12892 | 16734 | 2.9% |
| 12889 | 16196 | 2.8% |
| 11057 | 16141 | 2.8% |
| 14107 | 14620 | 2.5% |
| 12953 | 14514 | 2.5% |
| 13204 | 13981 | 2.4% |
| Other values (332) | 389176 |
| Value | Count | Frequency (%) |
| 10135 | 351 | 0.1% |
| 10136 | 93 | < 0.1% |
| 10140 | 2038 | |
| 10141 | 62 | < 0.1% |
| 10146 | 62 | < 0.1% |
| 10154 | 100 | < 0.1% |
| 10155 | 91 | < 0.1% |
| 10157 | 142 | < 0.1% |
| 10158 | 229 | < 0.1% |
| 10165 | 9 | < 0.1% |
| Value | Count | Frequency (%) |
| 16869 | 141 | < 0.1% |
| 16218 | 124 | < 0.1% |
| 15991 | 62 | < 0.1% |
| 15919 | 982 | |
| 15897 | 29 | < 0.1% |
| 15841 | 62 | < 0.1% |
| 15624 | 837 | |
| 15607 | 62 | < 0.1% |
| 15582 | 53 | < 0.1% |
| 15569 | 53 | < 0.1% |
DEST
Text
| Distinct | 342 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1739874 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LGA |
|---|---|
| 2nd row | JFK |
| 3rd row | JFK |
| 4th row | LGA |
| 5th row | CLE |
| Value | Count | Frequency (%) |
| atl | 28741 | 5.0% |
| den | 23988 | 4.1% |
| dfw | 23926 | 4.1% |
| ord | 21941 | 3.8% |
| lax | 16734 | 2.9% |
| las | 16196 | 2.8% |
| clt | 16141 | 2.8% |
| phx | 14620 | 2.5% |
| lga | 14514 | 2.5% |
| mco | 13981 | 2.4% |
| Other values (332) | 389176 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 198994 | 11.4% |
| L | 159700 | 9.2% |
| S | 149511 | 8.6% |
| D | 136309 | 7.8% |
| T | 91292 | 5.2% |
| O | 89854 | 5.2% |
| C | 87153 | 5.0% |
| M | 77779 | 4.5% |
| F | 72687 | 4.2% |
| W | 68468 | 3.9% |
| Other values (16) | 608127 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 198994 | 11.4% |
| L | 159700 | 9.2% |
| S | 149511 | 8.6% |
| D | 136309 | 7.8% |
| T | 91292 | 5.2% |
| O | 89854 | 5.2% |
| C | 87153 | 5.0% |
| M | 77779 | 4.5% |
| F | 72687 | 4.2% |
| W | 68468 | 3.9% |
| Other values (16) | 608127 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 198994 | 11.4% |
| L | 159700 | 9.2% |
| S | 149511 | 8.6% |
| D | 136309 | 7.8% |
| T | 91292 | 5.2% |
| O | 89854 | 5.2% |
| C | 87153 | 5.0% |
| M | 77779 | 4.5% |
| F | 72687 | 4.2% |
| W | 68468 | 3.9% |
| Other values (16) | 608127 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 198994 | 11.4% |
| L | 159700 | 9.2% |
| S | 149511 | 8.6% |
| D | 136309 | 7.8% |
| T | 91292 | 5.2% |
| O | 89854 | 5.2% |
| C | 87153 | 5.0% |
| M | 77779 | 4.5% |
| F | 72687 | 4.2% |
| W | 68468 | 3.9% |
| Other values (16) | 608127 |
DEST_CITY_NAME
Text
| Distinct | 336 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 13.035001 |
| Min length | 8 |
Characters and Unicode
| Total characters | 7559753 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New York, NY |
|---|---|
| 2nd row | New York, NY |
| 3rd row | New York, NY |
| 4th row | New York, NY |
| 5th row | Cleveland, OH |
| Value | Count | Frequency (%) |
| ca | 63472 | 4.7% |
| tx | 60319 | 4.5% |
| fl | 51187 | 3.8% |
| ny | 33161 | 2.5% |
| san | 30930 | 2.3% |
| ga | 30908 | 2.3% |
| new | 30879 | 2.3% |
| il | 30388 | 2.2% |
| chicago | 29275 | 2.2% |
| atlanta | 28741 | 2.1% |
| Other values (408) | 963431 |
Most occurring characters
| Value | Count | Frequency (%) |
| 772733 | 10.2% | |
| , | 579958 | 7.7% |
| a | 577214 | 7.6% |
| o | 417264 | 5.5% |
| e | 398338 | 5.3% |
| n | 371504 | 4.9% |
| t | 359253 | 4.8% |
| l | 331260 | 4.4% |
| i | 285723 | 3.8% |
| r | 274704 | 3.6% |
| Other values (47) | 3191802 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7559753 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 772733 | 10.2% | |
| , | 579958 | 7.7% |
| a | 577214 | 7.6% |
| o | 417264 | 5.5% |
| e | 398338 | 5.3% |
| n | 371504 | 4.9% |
| t | 359253 | 4.8% |
| l | 331260 | 4.4% |
| i | 285723 | 3.8% |
| r | 274704 | 3.6% |
| Other values (47) | 3191802 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7559753 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 772733 | 10.2% | |
| , | 579958 | 7.7% |
| a | 577214 | 7.6% |
| o | 417264 | 5.5% |
| e | 398338 | 5.3% |
| n | 371504 | 4.9% |
| t | 359253 | 4.8% |
| l | 331260 | 4.4% |
| i | 285723 | 3.8% |
| r | 274704 | 3.6% |
| Other values (47) | 3191802 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7559753 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 772733 | 10.2% | |
| , | 579958 | 7.7% |
| a | 577214 | 7.6% |
| o | 417264 | 5.5% |
| e | 398338 | 5.3% |
| n | 371504 | 4.9% |
| t | 359253 | 4.8% |
| l | 331260 | 4.4% |
| i | 285723 | 3.8% |
| r | 274704 | 3.6% |
| Other values (47) | 3191802 |
DEST_STATE_NM
Text
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 14 |
| Mean length | 8.1702606 |
| Min length | 4 |
Characters and Unicode
| Total characters | 4738408 |
|---|---|
| Distinct characters | 47 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New York |
|---|---|
| 2nd row | New York |
| 3rd row | New York |
| 4th row | New York |
| 5th row | Ohio |
| Value | Count | Frequency (%) |
| california | 63472 | 9.5% |
| texas | 60319 | 9.0% |
| florida | 51187 | 7.7% |
| new | 48773 | 7.3% |
| york | 33161 | 5.0% |
| georgia | 30908 | 4.6% |
| illinois | 30388 | 4.6% |
| carolina | 29379 | 4.4% |
| colorado | 26047 | 3.9% |
| north | 25231 | 3.8% |
| Other values (51) | 267898 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 633361 | |
| i | 533332 | 11.3% |
| o | 451253 | 9.5% |
| n | 347474 | 7.3% |
| r | 344591 | 7.3% |
| e | 294107 | 6.2% |
| s | 270873 | 5.7% |
| l | 261733 | 5.5% |
| C | 120756 | 2.5% |
| d | 113409 | 2.4% |
| Other values (37) | 1367519 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4738408 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 633361 | |
| i | 533332 | 11.3% |
| o | 451253 | 9.5% |
| n | 347474 | 7.3% |
| r | 344591 | 7.3% |
| e | 294107 | 6.2% |
| s | 270873 | 5.7% |
| l | 261733 | 5.5% |
| C | 120756 | 2.5% |
| d | 113409 | 2.4% |
| Other values (37) | 1367519 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4738408 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 633361 | |
| i | 533332 | 11.3% |
| o | 451253 | 9.5% |
| n | 347474 | 7.3% |
| r | 344591 | 7.3% |
| e | 294107 | 6.2% |
| s | 270873 | 5.7% |
| l | 261733 | 5.5% |
| C | 120756 | 2.5% |
| d | 113409 | 2.4% |
| Other values (37) | 1367519 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4738408 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 633361 | |
| i | 533332 | 11.3% |
| o | 451253 | 9.5% |
| n | 347474 | 7.3% |
| r | 344591 | 7.3% |
| e | 294107 | 6.2% |
| s | 270873 | 5.7% |
| l | 261733 | 5.5% |
| C | 120756 | 2.5% |
| d | 113409 | 2.4% |
| Other values (37) | 1367519 |
DEST_WAC
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 54.304236 |
| Minimum | 1 |
|---|---|
| Maximum | 93 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 13 |
| Q1 | 33 |
| median | 44 |
| Q3 | 82 |
| 95-th percentile | 91 |
| Maximum | 93 |
| Range | 92 |
| Interquartile range (IQR) | 49 |
Descriptive statistics
| Standard deviation | 26.863506 |
|---|---|
| Coefficient of variation (CV) | 0.49468527 |
| Kurtosis | -1.321858 |
| Mean | 54.304236 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | -0.013531847 |
| Sum | 31494176 |
| Variance | 721.64793 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 91 | 63472 | 10.9% |
| 74 | 60319 | 10.4% |
| 33 | 51187 | 8.8% |
| 22 | 33161 | 5.7% |
| 34 | 30908 | 5.3% |
| 41 | 30388 | 5.2% |
| 82 | 26047 | 4.5% |
| 36 | 23876 | 4.1% |
| 38 | 20417 | 3.5% |
| 85 | 17787 | 3.1% |
| Other values (42) | 222396 |
| Value | Count | Frequency (%) |
| 1 | 3049 | 0.5% |
| 2 | 11251 | |
| 3 | 3314 | 0.6% |
| 4 | 449 | 0.1% |
| 5 | 105 | < 0.1% |
| 11 | 1858 | 0.3% |
| 12 | 1238 | 0.2% |
| 13 | 12752 | |
| 14 | 573 | 0.1% |
| 15 | 1285 | 0.2% |
| Value | Count | Frequency (%) |
| 93 | 15965 | 2.8% |
| 92 | 6673 | 1.2% |
| 91 | 63472 | |
| 88 | 576 | 0.1% |
| 87 | 9609 | 1.7% |
| 86 | 2257 | 0.4% |
| 85 | 17787 | 3.1% |
| 84 | 1897 | 0.3% |
| 83 | 2248 | 0.4% |
| 82 | 26047 |
CRS_DEP_TIME
Real number (ℝ)
| Distinct | 1217 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1331.1444 |
| Minimum | 1 |
|---|---|
| Maximum | 2359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 600 |
| Q1 | 906 |
| median | 1320 |
| Q3 | 1743 |
| 95-th percentile | 2135 |
| Maximum | 2359 |
| Range | 2358 |
| Interquartile range (IQR) | 837 |
Descriptive statistics
| Standard deviation | 497.72232 |
|---|---|
| Coefficient of variation (CV) | 0.37390559 |
| Kurtosis | -1.0834288 |
| Mean | 1331.1444 |
| Median Absolute Deviation (MAD) | 418 |
| Skewness | 0.088311117 |
| Sum | 7.7200782 × 108 |
| Variance | 247727.51 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 600 | 12356 | 2.1% |
| 700 | 9499 | 1.6% |
| 800 | 5837 | 1.0% |
| 630 | 3982 | 0.7% |
| 900 | 3604 | 0.6% |
| 1000 | 3547 | 0.6% |
| 830 | 3375 | 0.6% |
| 730 | 3292 | 0.6% |
| 1100 | 3288 | 0.6% |
| 615 | 3118 | 0.5% |
| Other values (1207) | 528060 |
| Value | Count | Frequency (%) |
| 1 | 17 | |
| 3 | 4 | < 0.1% |
| 4 | 3 | < 0.1% |
| 5 | 1 | < 0.1% |
| 6 | 2 | < 0.1% |
| 9 | 3 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 13 | 4 | < 0.1% |
| 14 | 21 |
| Value | Count | Frequency (%) |
| 2359 | 835 | |
| 2358 | 29 | < 0.1% |
| 2357 | 33 | < 0.1% |
| 2356 | 38 | < 0.1% |
| 2355 | 261 | < 0.1% |
| 2354 | 67 | < 0.1% |
| 2353 | 63 | < 0.1% |
| 2352 | 1 | < 0.1% |
| 2351 | 29 | < 0.1% |
| 2350 | 230 | < 0.1% |
DEP_TIME
Real number (ℝ)
| Distinct | 1398 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3186 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1333.2283 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 558 |
| Q1 | 907 |
| median | 1324 |
| Q3 | 1750 |
| 95-th percentile | 2148 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 843 |
Descriptive statistics
| Standard deviation | 512.42953 |
|---|---|
| Coefficient of variation (CV) | 0.38435242 |
| Kurtosis | -1.0203534 |
| Mean | 1333.2283 |
| Median Absolute Deviation (MAD) | 422 |
| Skewness | 0.04735793 |
| Sum | 7.6896875 × 108 |
| Variance | 262584.02 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 555 | 1617 | 0.3% |
| 557 | 1503 | 0.3% |
| 556 | 1467 | 0.3% |
| 558 | 1357 | 0.2% |
| 554 | 1335 | 0.2% |
| 655 | 1305 | 0.2% |
| 559 | 1244 | 0.2% |
| 553 | 1134 | 0.2% |
| 657 | 1126 | 0.2% |
| 654 | 1117 | 0.2% |
| Other values (1388) | 563567 | |
| (Missing) | 3186 | 0.5% |
| Value | Count | Frequency (%) |
| 1 | 77 | |
| 2 | 54 | |
| 3 | 60 | |
| 4 | 52 | |
| 5 | 63 | |
| 6 | 53 | |
| 7 | 40 | |
| 8 | 58 | |
| 9 | 46 | |
| 10 | 51 |
| Value | Count | Frequency (%) |
| 2400 | 56 | |
| 2359 | 106 | |
| 2358 | 87 | |
| 2357 | 87 | |
| 2356 | 101 | |
| 2355 | 125 | |
| 2354 | 138 | |
| 2353 | 138 | |
| 2352 | 105 | |
| 2351 | 128 |
DEP_DELAY
Real number (ℝ)
ZEROS 
| Distinct | 1066 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3188 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.032191 |
| Minimum | -59 |
|---|---|
| Maximum | 3238 |
| Zeros | 28184 |
| Zeros (%) | 4.9% |
| Negative | 339694 |
| Negative (%) | 58.6% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | -59 |
|---|---|
| 5-th percentile | -10 |
| Q1 | -5 |
| median | -2 |
| Q3 | 7 |
| 95-th percentile | 67 |
| Maximum | 3238 |
| Range | 3297 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 49.40699 |
|---|---|
| Coefficient of variation (CV) | 4.9248452 |
| Kurtosis | 347.43824 |
| Mean | 10.032191 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 13.279506 |
| Sum | 5786267 |
| Variance | 2441.0506 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -5 | 45300 | 7.8% |
| -4 | 42291 | 7.3% |
| -3 | 40862 | 7.0% |
| -2 | 37138 | 6.4% |
| -6 | 35800 | 6.2% |
| -1 | 32727 | 5.6% |
| -7 | 30101 | 5.2% |
| 0 | 28184 | 4.9% |
| -8 | 23333 | 4.0% |
| -9 | 17125 | 3.0% |
| Other values (1056) | 243909 |
| Value | Count | Frequency (%) |
| -59 | 1 | < 0.1% |
| -55 | 1 | < 0.1% |
| -42 | 1 | < 0.1% |
| -40 | 1 | < 0.1% |
| -39 | 1 | < 0.1% |
| -35 | 1 | < 0.1% |
| -34 | 3 | |
| -33 | 1 | < 0.1% |
| -32 | 2 | < 0.1% |
| -31 | 6 |
| Value | Count | Frequency (%) |
| 3238 | 1 | |
| 3221 | 1 | |
| 2895 | 1 | |
| 2884 | 1 | |
| 2682 | 1 | |
| 2499 | 1 | |
| 2233 | 1 | |
| 2065 | 1 | |
| 1995 | 1 | |
| 1804 | 1 |
TAXI_OUT
Real number (ℝ)
| Distinct | 156 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3278 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.781609 |
| Minimum | 1 |
|---|---|
| Maximum | 175 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 9 |
| Q1 | 12 |
| median | 15 |
| Q3 | 19 |
| 95-th percentile | 31 |
| Maximum | 175 |
| Range | 174 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 8.1883753 |
|---|---|
| Coefficient of variation (CV) | 0.48793745 |
| Kurtosis | 20.360387 |
| Mean | 16.781609 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 3.0645717 |
| Sum | 9677618 |
| Variance | 67.049491 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 48654 | 8.4% |
| 13 | 48325 | 8.3% |
| 11 | 45668 | 7.9% |
| 14 | 44668 | 7.7% |
| 15 | 41031 | 7.1% |
| 10 | 37393 | 6.4% |
| 16 | 35543 | 6.1% |
| 17 | 31045 | 5.4% |
| 18 | 26298 | 4.5% |
| 9 | 26090 | 4.5% |
| Other values (146) | 191965 |
| Value | Count | Frequency (%) |
| 1 | 11 | < 0.1% |
| 2 | 13 | < 0.1% |
| 3 | 88 | < 0.1% |
| 4 | 238 | < 0.1% |
| 5 | 639 | 0.1% |
| 6 | 2441 | 0.4% |
| 7 | 7184 | 1.2% |
| 8 | 15289 | |
| 9 | 26090 | |
| 10 | 37393 |
| Value | Count | Frequency (%) |
| 175 | 1 | |
| 173 | 1 | |
| 171 | 2 | |
| 167 | 1 | |
| 159 | 1 | |
| 158 | 1 | |
| 157 | 1 | |
| 156 | 1 | |
| 155 | 2 | |
| 154 | 1 |
TAXI_IN
Real number (ℝ)
| Distinct | 146 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3387 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.0212203 |
| Minimum | 1 |
|---|---|
| Maximum | 168 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 4 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 19 |
| Maximum | 168 |
| Range | 167 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 6.5468197 |
|---|---|
| Coefficient of variation (CV) | 0.8161875 |
| Kurtosis | 41.222953 |
| Mean | 8.0212203 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 4.5675662 |
| Sum | 4624803 |
| Variance | 42.860848 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 87117 | |
| 5 | 81748 | |
| 6 | 67404 | |
| 7 | 55193 | |
| 3 | 53859 | |
| 8 | 42259 | |
| 9 | 33164 | 5.7% |
| 10 | 25719 | 4.4% |
| 11 | 19873 | 3.4% |
| 12 | 15907 | 2.7% |
| Other values (136) | 94328 |
| Value | Count | Frequency (%) |
| 1 | 963 | 0.2% |
| 2 | 14530 | 2.5% |
| 3 | 53859 | |
| 4 | 87117 | |
| 5 | 81748 | |
| 6 | 67404 | |
| 7 | 55193 | |
| 8 | 42259 | |
| 9 | 33164 | 5.7% |
| 10 | 25719 | 4.4% |
| Value | Count | Frequency (%) |
| 168 | 2 | |
| 164 | 1 | |
| 163 | 1 | |
| 158 | 1 | |
| 152 | 1 | |
| 151 | 1 | |
| 149 | 1 | |
| 145 | 1 | |
| 141 | 1 | |
| 139 | 2 |
CRS_ARR_TIME
Real number (ℝ)
| Distinct | 1299 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1485.5473 |
| Minimum | 1 |
|---|---|
| Maximum | 2359 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 715 |
| Q1 | 1056 |
| median | 1513 |
| Q3 | 1927 |
| 95-th percentile | 2301 |
| Maximum | 2359 |
| Range | 2358 |
| Interquartile range (IQR) | 871 |
Descriptive statistics
| Standard deviation | 527.21622 |
|---|---|
| Coefficient of variation (CV) | 0.35489697 |
| Kurtosis | -0.51553169 |
| Mean | 1485.5473 |
| Median Absolute Deviation (MAD) | 417 |
| Skewness | -0.27911186 |
| Sum | 8.6155503 × 108 |
| Variance | 277956.94 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2359 | 3343 | 0.6% |
| 2100 | 2183 | 0.4% |
| 1855 | 1886 | 0.3% |
| 1810 | 1877 | 0.3% |
| 2200 | 1855 | 0.3% |
| 900 | 1748 | 0.3% |
| 2000 | 1742 | 0.3% |
| 1100 | 1667 | 0.3% |
| 1940 | 1658 | 0.3% |
| 1845 | 1643 | 0.3% |
| Other values (1289) | 560356 |
| Value | Count | Frequency (%) |
| 1 | 33 | < 0.1% |
| 2 | 7 | < 0.1% |
| 3 | 26 | < 0.1% |
| 4 | 89 | < 0.1% |
| 5 | 749 | |
| 6 | 103 | < 0.1% |
| 7 | 136 | < 0.1% |
| 8 | 108 | < 0.1% |
| 9 | 107 | < 0.1% |
| 10 | 421 |
| Value | Count | Frequency (%) |
| 2359 | 3343 | |
| 2358 | 773 | 0.1% |
| 2357 | 1185 | 0.2% |
| 2356 | 519 | 0.1% |
| 2355 | 1124 | 0.2% |
| 2354 | 426 | 0.1% |
| 2353 | 439 | 0.1% |
| 2352 | 433 | 0.1% |
| 2351 | 405 | 0.1% |
| 2350 | 945 | 0.2% |
ARR_TIME
Real number (ℝ)
| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3387 |
| Missing (%) | 0.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1454.0404 |
| Minimum | 1 |
|---|---|
| Maximum | 2400 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 629 |
| Q1 | 1035 |
| median | 1456 |
| Q3 | 1918 |
| 95-th percentile | 2255 |
| Maximum | 2400 |
| Range | 2399 |
| Interquartile range (IQR) | 883 |
Descriptive statistics
| Standard deviation | 551.37251 |
|---|---|
| Coefficient of variation (CV) | 0.37920026 |
| Kurtosis | -0.42530542 |
| Mean | 1454.0404 |
| Median Absolute Deviation (MAD) | 440 |
| Skewness | -0.35410675 |
| Sum | 8.3835754 × 108 |
| Variance | 304011.64 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1849 | 656 | 0.1% |
| 1845 | 650 | 0.1% |
| 1638 | 637 | 0.1% |
| 1000 | 634 | 0.1% |
| 1142 | 633 | 0.1% |
| 1646 | 629 | 0.1% |
| 2137 | 623 | 0.1% |
| 927 | 622 | 0.1% |
| 1150 | 622 | 0.1% |
| 1633 | 620 | 0.1% |
| Other values (1430) | 570245 | |
| (Missing) | 3387 | 0.6% |
| Value | Count | Frequency (%) |
| 1 | 382 | |
| 2 | 360 | |
| 3 | 346 | |
| 4 | 355 | |
| 5 | 331 | |
| 6 | 333 | |
| 7 | 312 | |
| 8 | 316 | |
| 9 | 304 | |
| 10 | 330 |
| Value | Count | Frequency (%) |
| 2400 | 313 | |
| 2359 | 384 | |
| 2358 | 425 | |
| 2357 | 434 | |
| 2356 | 401 | |
| 2355 | 424 | |
| 2354 | 400 | |
| 2353 | 457 | |
| 2352 | 466 | |
| 2351 | 441 |
ARR_DELAY
Real number (ℝ)
ZEROS 
| Distinct | 1099 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 4529 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.8132541 |
| Minimum | -84 |
|---|---|
| Maximum | 3241 |
| Zeros | 10858 |
| Zeros (%) | 1.9% |
| Negative | 369928 |
| Negative (%) | 63.8% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | -84 |
|---|---|
| 5-th percentile | -27 |
| Q1 | -15 |
| median | -7 |
| Q3 | 7 |
| 95-th percentile | 65 |
| Maximum | 3241 |
| Range | 3325 |
| Interquartile range (IQR) | 22 |
Descriptive statistics
| Standard deviation | 51.106806 |
|---|---|
| Coefficient of variation (CV) | 13.402413 |
| Kurtosis | 307.21373 |
| Mean | 3.8132541 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 12.082205 |
| Sum | 2194257 |
| Variance | 2611.9056 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -11 | 17424 | 3.0% |
| -10 | 17140 | 3.0% |
| -12 | 17136 | 3.0% |
| -9 | 16873 | 2.9% |
| -13 | 16736 | 2.9% |
| -8 | 16585 | 2.9% |
| -14 | 16357 | 2.8% |
| -7 | 15756 | 2.7% |
| -15 | 15385 | 2.7% |
| -6 | 15351 | 2.6% |
| Other values (1089) | 410686 |
| Value | Count | Frequency (%) |
| -84 | 1 | < 0.1% |
| -82 | 1 | < 0.1% |
| -80 | 1 | < 0.1% |
| -79 | 2 | < 0.1% |
| -77 | 1 | < 0.1% |
| -76 | 3 | |
| -75 | 2 | < 0.1% |
| -74 | 2 | < 0.1% |
| -73 | 5 | |
| -72 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 3241 | 1 | |
| 3237 | 1 | |
| 2900 | 2 | |
| 2682 | 1 | |
| 2499 | 1 | |
| 2260 | 1 | |
| 2072 | 1 | |
| 2006 | 1 | |
| 1832 | 1 | |
| 1790 | 1 |
CANCELLED
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
| 0.0 | |
|---|---|
| 1.0 | 3310 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1739874 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 576648 | |
| 1.0 | 3310 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 576648 | |
| 1.0 | 3310 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1156606 | |
| . | 579958 | |
| 1 | 3310 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1156606 | |
| . | 579958 | |
| 1 | 3310 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1156606 | |
| . | 579958 | |
| 1 | 3310 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1156606 | |
| . | 579958 | |
| 1 | 3310 | 0.2% |
CANCELLATION_CODE
Categorical
MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 576648 |
| Missing (%) | 99.4% |
| Memory size | 4.4 MiB |
| A | |
|---|---|
| B | |
| C | |
| D | 4 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3310 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | C |
| 3rd row | C |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 1587 | 0.3% |
| B | 1527 | 0.3% |
| C | 192 | < 0.1% |
| D | 4 | < 0.1% |
| (Missing) | 576648 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 1587 | |
| b | 1527 | |
| c | 192 | 5.8% |
| d | 4 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1587 | |
| B | 1527 | |
| C | 192 | 5.8% |
| D | 4 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3310 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 1587 | |
| B | 1527 | |
| C | 192 | 5.8% |
| D | 4 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3310 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 1587 | |
| B | 1527 | |
| C | 192 | 5.8% |
| D | 4 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3310 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 1587 | |
| B | 1527 | |
| C | 192 | 5.8% |
| D | 4 | 0.1% |
DIVERTED
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.4 MiB |
| 0.0 | |
|---|---|
| 1.0 | 1218 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1739874 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 578740 | |
| 1.0 | 1218 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 578740 | |
| 1.0 | 1218 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1158698 | |
| . | 579958 | |
| 1 | 1218 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1158698 | |
| . | 579958 | |
| 1 | 1218 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1158698 | |
| . | 579958 | |
| 1 | 1218 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1739874 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1158698 | |
| . | 579958 | |
| 1 | 1218 | 0.1% |
AIR_TIME
Real number (ℝ)
| Distinct | 608 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 4529 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 113.73129 |
| Minimum | 8 |
|---|---|
| Maximum | 648 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 36 |
| Q1 | 62 |
| median | 96 |
| Q3 | 143 |
| 95-th percentile | 272 |
| Maximum | 648 |
| Range | 640 |
| Interquartile range (IQR) | 81 |
Descriptive statistics
| Standard deviation | 70.123149 |
|---|---|
| Coefficient of variation (CV) | 0.61656863 |
| Kurtosis | 2.3084628 |
| Mean | 113.73129 |
| Median Absolute Deviation (MAD) | 38 |
| Skewness | 1.402298 |
| Sum | 65444285 |
| Variance | 4917.256 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 63 | 5102 | 0.9% |
| 65 | 5101 | 0.9% |
| 64 | 5065 | 0.9% |
| 62 | 4933 | 0.9% |
| 61 | 4874 | 0.8% |
| 66 | 4847 | 0.8% |
| 67 | 4810 | 0.8% |
| 53 | 4791 | 0.8% |
| 54 | 4770 | 0.8% |
| 60 | 4769 | 0.8% |
| Other values (598) | 526367 |
| Value | Count | Frequency (%) |
| 8 | 5 | < 0.1% |
| 9 | 19 | < 0.1% |
| 10 | 25 | < 0.1% |
| 11 | 15 | < 0.1% |
| 12 | 8 | < 0.1% |
| 13 | 12 | < 0.1% |
| 14 | 11 | < 0.1% |
| 15 | 31 | < 0.1% |
| 16 | 104 | |
| 17 | 190 |
| Value | Count | Frequency (%) |
| 648 | 1 | |
| 646 | 1 | |
| 645 | 1 | |
| 639 | 1 | |
| 636 | 1 | |
| 632 | 1 | |
| 631 | 1 | |
| 630 | 1 | |
| 625 | 2 | |
| 624 | 1 |
CARRIER_DELAY
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 844 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 475839 |
| Missing (%) | 82.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.089263 |
| Minimum | 0 |
|---|---|
| Maximum | 3221 |
| Zeros | 41858 |
| Zeros (%) | 7.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 6 |
| Q3 | 23 |
| 95-th percentile | 101 |
| Maximum | 3221 |
| Range | 3221 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 73.84996 |
|---|---|
| Coefficient of variation (CV) | 2.9434886 |
| Kurtosis | 207.27696 |
| Mean | 25.089263 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 11.057327 |
| Sum | 2612269 |
| Variance | 5453.8167 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 41858 | 7.2% |
| 2 | 2085 | 0.4% |
| 1 | 2055 | 0.4% |
| 6 | 1951 | 0.3% |
| 3 | 1933 | 0.3% |
| 15 | 1919 | 0.3% |
| 4 | 1896 | 0.3% |
| 7 | 1844 | 0.3% |
| 5 | 1828 | 0.3% |
| 16 | 1752 | 0.3% |
| Other values (834) | 44998 | 7.8% |
| (Missing) | 475839 |
| Value | Count | Frequency (%) |
| 0 | 41858 | |
| 1 | 2055 | 0.4% |
| 2 | 2085 | 0.4% |
| 3 | 1933 | 0.3% |
| 4 | 1896 | 0.3% |
| 5 | 1828 | 0.3% |
| 6 | 1951 | 0.3% |
| 7 | 1844 | 0.3% |
| 8 | 1719 | 0.3% |
| 9 | 1653 | 0.3% |
| Value | Count | Frequency (%) |
| 3221 | 1 | |
| 2884 | 1 | |
| 2682 | 1 | |
| 2499 | 1 | |
| 2233 | 1 | |
| 1804 | 1 | |
| 1782 | 1 | |
| 1766 | 1 | |
| 1754 | 1 | |
| 1664 | 1 |
WEATHER_DELAY
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 349 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 475839 |
| Missing (%) | 82.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8303672 |
| Minimum | 0 |
|---|---|
| Maximum | 1439 |
| Zeros | 99571 |
| Zeros (%) | 17.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1439 |
| Range | 1439 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 24.373939 |
|---|---|
| Coefficient of variation (CV) | 8.611582 |
| Kurtosis | 745.43133 |
| Mean | 2.8303672 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 22.112416 |
| Sum | 294695 |
| Variance | 594.0889 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 99571 | 17.2% |
| 17 | 86 | < 0.1% |
| 6 | 81 | < 0.1% |
| 19 | 81 | < 0.1% |
| 16 | 81 | < 0.1% |
| 5 | 74 | < 0.1% |
| 18 | 73 | < 0.1% |
| 13 | 72 | < 0.1% |
| 8 | 69 | < 0.1% |
| 15 | 69 | < 0.1% |
| Other values (339) | 3862 | 0.7% |
| (Missing) | 475839 |
| Value | Count | Frequency (%) |
| 0 | 99571 | |
| 1 | 60 | < 0.1% |
| 2 | 53 | < 0.1% |
| 3 | 57 | < 0.1% |
| 4 | 46 | < 0.1% |
| 5 | 74 | < 0.1% |
| 6 | 81 | < 0.1% |
| 7 | 62 | < 0.1% |
| 8 | 69 | < 0.1% |
| 9 | 66 | < 0.1% |
| Value | Count | Frequency (%) |
| 1439 | 1 | |
| 1393 | 1 | |
| 1188 | 1 | |
| 942 | 1 | |
| 937 | 1 | |
| 934 | 1 | |
| 927 | 1 | |
| 920 | 1 | |
| 919 | 1 | |
| 905 | 1 |
NAS_DELAY
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 330 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 475839 |
| Missing (%) | 82.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.469444 |
| Minimum | 0 |
|---|---|
| Maximum | 1515 |
| Zeros | 57455 |
| Zeros (%) | 9.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 14 |
| 95-th percentile | 45 |
| Maximum | 1515 |
| Range | 1515 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 26.456887 |
|---|---|
| Coefficient of variation (CV) | 2.5270576 |
| Kurtosis | 387.26735 |
| Mean | 10.469444 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 12.826572 |
| Sum | 1090068 |
| Variance | 699.96689 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 57455 | 9.9% |
| 1 | 2686 | 0.5% |
| 2 | 1980 | 0.3% |
| 15 | 1894 | 0.3% |
| 3 | 1845 | 0.3% |
| 4 | 1810 | 0.3% |
| 16 | 1713 | 0.3% |
| 5 | 1614 | 0.3% |
| 6 | 1576 | 0.3% |
| 17 | 1534 | 0.3% |
| Other values (320) | 30012 | 5.2% |
| (Missing) | 475839 |
| Value | Count | Frequency (%) |
| 0 | 57455 | |
| 1 | 2686 | 0.5% |
| 2 | 1980 | 0.3% |
| 3 | 1845 | 0.3% |
| 4 | 1810 | 0.3% |
| 5 | 1614 | 0.3% |
| 6 | 1576 | 0.3% |
| 7 | 1487 | 0.3% |
| 8 | 1373 | 0.2% |
| 9 | 1291 | 0.2% |
| Value | Count | Frequency (%) |
| 1515 | 1 | |
| 1421 | 1 | |
| 1065 | 1 | |
| 975 | 1 | |
| 905 | 1 | |
| 883 | 1 | |
| 857 | 1 | |
| 831 | 1 | |
| 811 | 1 | |
| 800 | 1 |
SECURITY_DELAY
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 95 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 475839 |
| Missing (%) | 82.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.16002843 |
| Minimum | 0 |
|---|---|
| Maximum | 1183 |
| Zeros | 103517 |
| Zeros (%) | 17.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1183 |
| Range | 1183 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 4.8834056 |
|---|---|
| Coefficient of variation (CV) | 30.515863 |
| Kurtosis | 33823.228 |
| Mean | 0.16002843 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 151.31394 |
| Sum | 16662 |
| Variance | 23.84765 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 103517 | 17.8% |
| 10 | 30 | < 0.1% |
| 15 | 26 | < 0.1% |
| 17 | 25 | < 0.1% |
| 8 | 24 | < 0.1% |
| 12 | 23 | < 0.1% |
| 13 | 21 | < 0.1% |
| 16 | 20 | < 0.1% |
| 18 | 19 | < 0.1% |
| 19 | 18 | < 0.1% |
| Other values (85) | 396 | 0.1% |
| (Missing) | 475839 |
| Value | Count | Frequency (%) |
| 0 | 103517 | |
| 1 | 13 | < 0.1% |
| 2 | 14 | < 0.1% |
| 3 | 17 | < 0.1% |
| 4 | 13 | < 0.1% |
| 5 | 14 | < 0.1% |
| 6 | 15 | < 0.1% |
| 7 | 17 | < 0.1% |
| 8 | 24 | < 0.1% |
| 9 | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 1183 | 1 | |
| 371 | 1 | |
| 289 | 1 | |
| 277 | 1 | |
| 272 | 1 | |
| 223 | 1 | |
| 196 | 1 | |
| 146 | 1 | |
| 137 | 2 | |
| 133 | 1 |
LATE_AIRCRAFT_DELAY
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 664 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 475839 |
| Missing (%) | 82.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.393655 |
| Minimum | 0 |
|---|---|
| Maximum | 3228 |
| Zeros | 50580 |
| Zeros (%) | 8.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 3 |
| Q3 | 30 |
| 95-th percentile | 115 |
| Maximum | 3228 |
| Range | 3228 |
| Interquartile range (IQR) | 30 |
Descriptive statistics
| Standard deviation | 61.088421 |
|---|---|
| Coefficient of variation (CV) | 2.3145116 |
| Kurtosis | 187.81024 |
| Mean | 26.393655 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 9.1429102 |
| Sum | 2748081 |
| Variance | 3731.7952 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 50580 | 8.7% |
| 15 | 1361 | 0.2% |
| 16 | 1332 | 0.2% |
| 17 | 1260 | 0.2% |
| 18 | 1174 | 0.2% |
| 19 | 1103 | 0.2% |
| 20 | 1096 | 0.2% |
| 21 | 1024 | 0.2% |
| 14 | 1001 | 0.2% |
| 11 | 989 | 0.2% |
| Other values (654) | 43199 | 7.4% |
| (Missing) | 475839 |
| Value | Count | Frequency (%) |
| 0 | 50580 | |
| 1 | 711 | 0.1% |
| 2 | 738 | 0.1% |
| 3 | 720 | 0.1% |
| 4 | 778 | 0.1% |
| 5 | 779 | 0.1% |
| 6 | 888 | 0.2% |
| 7 | 850 | 0.1% |
| 8 | 908 | 0.2% |
| 9 | 868 | 0.1% |
| Value | Count | Frequency (%) |
| 3228 | 1 | |
| 2065 | 1 | |
| 1970 | 1 | |
| 1664 | 1 | |
| 1557 | 1 | |
| 1525 | 1 | |
| 1434 | 1 | |
| 1421 | 1 | |
| 1380 | 1 | |
| 1353 | 1 |
| DAY_OF_WEEK | FL_DATE | OP_UNIQUE_CARRIER | OP_CARRIER_FL_NUM | ORIGIN_AIRPORT_ID | ORIGIN | ORIGIN_CITY_NAME | ORIGIN_STATE_NM | ORIGIN_WAC | DEST_AIRPORT_ID | DEST | DEST_CITY_NAME | DEST_STATE_NM | DEST_WAC | CRS_DEP_TIME | DEP_TIME | DEP_DELAY | TAXI_OUT | TAXI_IN | CRS_ARR_TIME | ARR_TIME | ARR_DELAY | CANCELLED | CANCELLATION_CODE | DIVERTED | AIR_TIME | CARRIER_DELAY | WEATHER_DELAY | NAS_DELAY | SECURITY_DELAY | LATE_AIRCRAFT_DELAY | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 5/1/2023 12:00:00 AM | 9E | 4628 | 14576 | ROC | Rochester, NY | New York | 22 | 12953 | LGA | New York, NY | New York | 22 | 1000 | 955.0 | -5.0 | 12.0 | 33.0 | 1122 | 1128.0 | 6.0 | 0.0 | NaN | 0.0 | 48.0 | NaN | NaN | NaN | NaN | NaN |
| 1 | 1 | 5/1/2023 12:00:00 AM | 9E | 4629 | 12397 | ITH | Ithaca/Cortland, NY | New York | 22 | 12478 | JFK | New York, NY | New York | 22 | 1520 | 1605.0 | 45.0 | 13.0 | 9.0 | 1629 | 1712.0 | 43.0 | 0.0 | NaN | 0.0 | 45.0 | 0.0 | 0.0 | 0.0 | 0.0 | 43.0 |
| 2 | 1 | 5/1/2023 12:00:00 AM | 9E | 4630 | 11042 | CLE | Cleveland, OH | Ohio | 44 | 12478 | JFK | New York, NY | New York | 22 | 1644 | 1642.0 | -2.0 | 10.0 | 8.0 | 1829 | 1811.0 | -18.0 | 0.0 | NaN | 0.0 | 71.0 | NaN | NaN | NaN | NaN | NaN |
| 3 | 1 | 5/1/2023 12:00:00 AM | 9E | 4631 | 12264 | IAD | Washington, DC | Virginia | 38 | 12953 | LGA | New York, NY | New York | 22 | 1805 | 1849.0 | 44.0 | 14.0 | 7.0 | 1929 | 1958.0 | 29.0 | 0.0 | NaN | 0.0 | 48.0 | 0.0 | 0.0 | 0.0 | 0.0 | 29.0 |
| 4 | 1 | 5/1/2023 12:00:00 AM | 9E | 4632 | 12478 | JFK | New York, NY | New York | 22 | 11042 | CLE | Cleveland, OH | Ohio | 44 | 1458 | 1453.0 | -5.0 | 28.0 | 10.0 | 1704 | 1651.0 | -13.0 | 0.0 | NaN | 0.0 | 80.0 | NaN | NaN | NaN | NaN | NaN |
| 5 | 1 | 5/1/2023 12:00:00 AM | 9E | 4633 | 11042 | CLE | Cleveland, OH | Ohio | 44 | 12953 | LGA | New York, NY | New York | 22 | 1750 | 1747.0 | -3.0 | 18.0 | 6.0 | 1925 | 1924.0 | -1.0 | 0.0 | NaN | 0.0 | 73.0 | NaN | NaN | NaN | NaN | NaN |
| 6 | 1 | 5/1/2023 12:00:00 AM | 9E | 4634 | 12953 | LGA | New York, NY | New York | 22 | 11042 | CLE | Cleveland, OH | Ohio | 44 | 1250 | 1259.0 | 9.0 | 24.0 | 17.0 | 1441 | 1447.0 | 6.0 | 0.0 | NaN | 0.0 | 67.0 | NaN | NaN | NaN | NaN | NaN |
| 7 | 1 | 5/1/2023 12:00:00 AM | 9E | 4635 | 12478 | JFK | New York, NY | New York | 22 | 10821 | BWI | Baltimore, MD | Maryland | 35 | 1455 | 1453.0 | -2.0 | 25.0 | 6.0 | 1625 | 1601.0 | -24.0 | 0.0 | NaN | 0.0 | 37.0 | NaN | NaN | NaN | NaN | NaN |
| 8 | 1 | 5/1/2023 12:00:00 AM | 9E | 4636 | 10423 | AUS | Austin, TX | Texas | 74 | 14492 | RDU | Raleigh/Durham, NC | North Carolina | 36 | 1000 | 950.0 | -10.0 | 11.0 | 3.0 | 1352 | 1318.0 | -34.0 | 0.0 | NaN | 0.0 | 134.0 | NaN | NaN | NaN | NaN | NaN |
| 9 | 1 | 5/1/2023 12:00:00 AM | 9E | 4638 | 12953 | LGA | New York, NY | New York | 22 | 13244 | MEM | Memphis, TN | Tennessee | 54 | 859 | 1132.0 | 153.0 | 29.0 | 6.0 | 1102 | 1326.0 | 144.0 | 0.0 | NaN | 0.0 | 139.0 | 144.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| DAY_OF_WEEK | FL_DATE | OP_UNIQUE_CARRIER | OP_CARRIER_FL_NUM | ORIGIN_AIRPORT_ID | ORIGIN | ORIGIN_CITY_NAME | ORIGIN_STATE_NM | ORIGIN_WAC | DEST_AIRPORT_ID | DEST | DEST_CITY_NAME | DEST_STATE_NM | DEST_WAC | CRS_DEP_TIME | DEP_TIME | DEP_DELAY | TAXI_OUT | TAXI_IN | CRS_ARR_TIME | ARR_TIME | ARR_DELAY | CANCELLED | CANCELLATION_CODE | DIVERTED | AIR_TIME | CARRIER_DELAY | WEATHER_DELAY | NAS_DELAY | SECURITY_DELAY | LATE_AIRCRAFT_DELAY | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 579948 | 7 | 5/28/2023 12:00:00 AM | YX | 5844 | 11278 | DCA | Washington, DC | Virginia | 38 | 12953 | LGA | New York, NY | New York | 22 | 1800 | 1754.0 | -6.0 | 8.0 | 7.0 | 1940 | 1854.0 | -46.0 | 0.0 | NaN | 0.0 | 45.0 | NaN | NaN | NaN | NaN | NaN |
| 579949 | 7 | 5/28/2023 12:00:00 AM | YX | 5846 | 10721 | BOS | Boston, MA | Massachusetts | 13 | 11278 | DCA | Washington, DC | Virginia | 38 | 1825 | 1822.0 | -3.0 | 19.0 | 2.0 | 2020 | 1951.0 | -29.0 | 0.0 | NaN | 0.0 | 68.0 | NaN | NaN | NaN | NaN | NaN |
| 579950 | 7 | 5/28/2023 12:00:00 AM | YX | 5848 | 10721 | BOS | Boston, MA | Massachusetts | 13 | 12478 | JFK | New York, NY | New York | 22 | 906 | 902.0 | -4.0 | 18.0 | 7.0 | 1036 | 1009.0 | -27.0 | 0.0 | NaN | 0.0 | 42.0 | NaN | NaN | NaN | NaN | NaN |
| 579951 | 7 | 5/28/2023 12:00:00 AM | YX | 5852 | 10721 | BOS | Boston, MA | Massachusetts | 13 | 14100 | PHL | Philadelphia, PA | Pennsylvania | 23 | 1925 | 1915.0 | -10.0 | 17.0 | 5.0 | 2106 | 2028.0 | -38.0 | 0.0 | NaN | 0.0 | 51.0 | NaN | NaN | NaN | NaN | NaN |
| 579952 | 7 | 5/28/2023 12:00:00 AM | YX | 5853 | 12478 | JFK | New York, NY | New York | 22 | 10721 | BOS | Boston, MA | Massachusetts | 13 | 600 | 558.0 | -2.0 | 12.0 | 7.0 | 720 | 700.0 | -20.0 | 0.0 | NaN | 0.0 | 43.0 | NaN | NaN | NaN | NaN | NaN |
| 579953 | 7 | 5/28/2023 12:00:00 AM | YX | 5854 | 10693 | BNA | Nashville, TN | Tennessee | 54 | 12953 | LGA | New York, NY | New York | 22 | 1253 | 1245.0 | -8.0 | 16.0 | 10.0 | 1621 | 1619.0 | -2.0 | 0.0 | NaN | 0.0 | 128.0 | NaN | NaN | NaN | NaN | NaN |
| 579954 | 7 | 5/28/2023 12:00:00 AM | YX | 5854 | 12953 | LGA | New York, NY | New York | 22 | 10693 | BNA | Nashville, TN | Tennessee | 54 | 1024 | 1015.0 | -9.0 | 13.0 | 3.0 | 1208 | 1114.0 | -54.0 | 0.0 | NaN | 0.0 | 103.0 | NaN | NaN | NaN | NaN | NaN |
| 579955 | 7 | 5/28/2023 12:00:00 AM | YX | 5855 | 14730 | SDF | Louisville, KY | Kentucky | 52 | 12953 | LGA | New York, NY | New York | 22 | 1236 | 1235.0 | -1.0 | 14.0 | 7.0 | 1452 | 1445.0 | -7.0 | 0.0 | NaN | 0.0 | 109.0 | NaN | NaN | NaN | NaN | NaN |
| 579956 | 7 | 5/28/2023 12:00:00 AM | YX | 5857 | 14524 | RIC | Richmond, VA | Virginia | 38 | 12953 | LGA | New York, NY | New York | 22 | 1942 | 2119.0 | 97.0 | 12.0 | 6.0 | 2112 | 2228.0 | 76.0 | 0.0 | NaN | 0.0 | 51.0 | 76.0 | 0.0 | 0.0 | 0.0 | 0.0 |
| 579957 | 7 | 5/28/2023 12:00:00 AM | YX | 5861 | 10721 | BOS | Boston, MA | Massachusetts | 13 | 11278 | DCA | Washington, DC | Virginia | 38 | 1019 | 1012.0 | -7.0 | 16.0 | 6.0 | 1204 | 1138.0 | -26.0 | 0.0 | NaN | 0.0 | 64.0 | NaN | NaN | NaN | NaN | NaN |